Detection of social speech signals using adaptation of segmental HMMs
نویسندگان
چکیده
This paper proposes an approach to detect social speech signals by computing segmental features using adaptation of segmental Hidden Markov Models (HMMs). This approach uses segmental HMMs and model adaptation techniques such as Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP) in order to acquire specific (or adapted) segmental HMMs that are fine-tuned to detect local regions of social signals such as laughter and fillers. Several segmental features are computed on automatically segmented audio with the specific segmental HMMs. Subsequently, the segmental features are used to detect social signals using Support Vector Machines (SVMs). The results indicate that the proposed segmental features play a significant role in detection of social speech signals.
منابع مشابه
EXPERIMENTAL EVALUATION OF SEGMENTAL HMMS - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
The aim of the research described in this paper is to overcome important speech-modeling limitations of conventional hidden Markov models (HMMs), by developing a dynamic segmental HMM which models the changing pattern of speech over the duration of some phoneme-type unit. As a first step towards this goal, a static segmental HMM [3] has been implemented and tested, This model reduces the influe...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملSegmental vocoder-going beyond the phonetic approach
In our paper, the problem of very low bit rate segmental speech coding is addressed. The basic units are found automatically in the training database using temporal decomposition, vector quantization and multigrams. They are modelled by HMMs. The coding is based on recognition and synthesis. In single speaker tests, we obtained intelligible and naturally sounding speech at mean rate of 211.2 b/...
متن کاملReducing computational load in segmental hidden Markov model decoding for speech recognition
Introduction: Research into segment models (SMs) for automatic speech recognition is motivated by limitations of conventional hidden Markov models (HMMs). While HMMs associate states with individual feature vectors, SMs associate states with sequences of vectors (segments) [1], or variable duration acoustic features [2], thereby allowing important static and dynamic structure to be modelled. Gl...
متن کاملProbabilistic-trajectory segmental HMMs
“Segmental hidden Markov models” (SHMMs) are intended to overcome important speech-modelling limitations of the conventional-HMM approach by representing sequences (or segments) of features and incorporating the concept of trajectories to describe how features change over time. A novel feature of the approach presented in this paper is that extra-segmental variability between different examples...
متن کامل